examples/dreambooth: fix missing `weighting` chunk when using prior preservation in Flux and SD3 LoRA training by Dev-X25874 · Pull Request #13743 · huggingface/diffusers

Dev-X25874 · 2026-05-14T02:09:09Z

What does this PR do?

When --with_prior_preservation is enabled, the training batch concatenates
instance and class (prior) samples, so every per-sample tensor —
model_pred, target, sigmas, and therefore weighting — has shape
(2 * train_batch_size, ...).

Inside the loss block, model_pred and target are correctly split via
torch.chunk(..., 2, dim=0), but weighting was never chunked. This means:

weighting (size 2B) is broadcast against model_pred_prior and
target_prior (size B), producing a loss tensor of the wrong shape and
applying incorrectly paired timestep weights to the prior loss term.
The instance loss term also gets weights from the full unsplit weighting
instead of only the instance-sample half.

The correct pattern already exists in train_dreambooth_lora_flux2.py:

weighting, weighting_prior = torch.chunk(weighting, 2, dim=0)

This PR applies the same fix to train_dreambooth_lora_flux.py and
train_dreambooth_lora_sd3.py, which were both missing it.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@sayakpaul

…target when using prior preservation (flux LoRA)

…target when using prior preservation (SD3 LoRA)

Dev-X25874 · 2026-05-15T11:34:10Z

Hi @sayakpaul, would you mind taking a look at this when you get a chance?

The bug is present in both train_dreambooth_lora_flux.py and train_dreambooth_lora_sd3.py — when --with_prior_preservation is enabled, weighting is never chunked alongside model_pred and target, causing incorrect timestep weights to be applied to the prior loss term. The fix already exists in train_dreambooth_lora_flux2.py (line 1832), so this PR simply backports it to the two older scripts. Happy to make any changes if needed!

Dev-X25874 added 2 commits May 14, 2026 07:35

examples/dreambooth: chunk weighting tensor alongside model_pred and …

7e754e7

…target when using prior preservation (flux LoRA)

examples/dreambooth: chunk weighting tensor alongside model_pred and …

9a3f50c

…target when using prior preservation (SD3 LoRA)

github-actions Bot added examples size/S PR with diff < 50 LOC labels May 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples/dreambooth: fix missing `weighting` chunk when using prior preservation in Flux and SD3 LoRA training#13743

examples/dreambooth: fix missing `weighting` chunk when using prior preservation in Flux and SD3 LoRA training#13743
Dev-X25874 wants to merge 2 commits into
huggingface:mainfrom
Dev-X25874:fix/dreambooth-prior-preservation-weighting-chunk

Dev-X25874 commented May 14, 2026

Uh oh!

Dev-X25874 commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Dev-X25874 commented May 14, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

Dev-X25874 commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant